Monocular Depth Estimation Through Virtual-World Supervision and Real-World SfM Self-Supervision
نویسندگان
چکیده
Depth information is essential for on-board perception in autonomous driving and driver assistance. Monocular depth estimation (MDE) very appealing since it allows appearance being on direct pixelwise correspondence without further calibration. Best MDE models are based Convolutional Neural Networks (CNNs) trained a supervised manner, i.e ., assuming ground truth (GT). Usually, this GT acquired at training time through calibrated multi-modal suite of sensors. However, also using only monocular system cheaper more scalable. This possible by relying structure-from-motion (SfM) principles to generate self-supervision. Nevertheless, problems camouflaged objects, visibility changes, static-camera intervals, textureless areas, scale ambiguity, diminish the usefulness such In paper, we perform xmlns:xlink="http://www.w3.org/1999/xlink">mono cular ${d}$ epth notation="LaTeX">${e}$ stimation notation="LaTeX">${v}$ irtual-world notation="LaTeX">${s}$ upervision (MonoDEVS) real-world SfM We compensate self-supervision limitations leveraging virtual-world images with accurate semantic supervision, addressing virtual-to-real domain gap. Our MonoDEVSNet outperforms previous CNNs even stereo sequences.
منابع مشابه
Aperture Supervision for Monocular Depth Estimation
We present a novel method to train machine learning algorithms to estimate scene depths from a single image, by using the information provided by a camera’s aperture as supervision. Prior works use a depth sensor’s outputs or images of the same scene from alternate viewpoints as supervision, while our method instead uses images from the same viewpoint taken with a varying camera aperture. To en...
متن کاملA World Knowledge as Indirect Supervision for Document Clustering
One of the key obstacles in making learning protocols realistic in applications is the need to supervise them, a costly process that often requires hiring domain experts. We consider the framework to use the world knowledge as indirect supervision. World knowledge is general-purpose knowledge, which is not designed for any specific domain. Then the key challenges are how to adapt the world know...
متن کاملVirtual Objects Real World
Computing at the intersection of two realms. T HE TERM VIRTUAL REALITY USUALLY REFERS TO SYSTEMS THAT generate a totally synthetic environment, in which the end user is able to specify all the characteristics of the new environment. A head-mounted display and an orientation and position tracker enable the user to roam around the new virtual world, look up and down, and walk into rooms that don'...
متن کاملMonocular Obstacle Detection for Real-World Environments
In this paper, we present a feature based approach for monocular scene reconstruction based on extended Kalman filters (EKF). Our method processes a sequence of images taken by a single camera mounted in front of a mobile robot. Using different techniques we are able to produce a precise reconstruction that is free from outliers and therefore can be used for reliable obstacle detection and avoi...
متن کاملNumerical Estimation Under Supervision
Children's number-line estimation has produced a lively debate about representational change, supported by apparently incompatible data regarding the descriptive adequacy of logarithmic (Opfer et al., 2011) and power models (Slusser et al., 2013). To test whether methodological differences might explain discrepant findings, we created a fully crossed 2x2 design and assigned 96 children to one o...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Transactions on Intelligent Transportation Systems
سال: 2022
ISSN: ['1558-0016', '1524-9050']
DOI: https://doi.org/10.1109/tits.2021.3117059